26/03/2020
Updates
Examination Part I: Timeline of take-home exercises
- Examination handed out via GitHub (Classroom): 7 May 2020
- Deadline to hand in results: 8 June 2020 (16:00)
Format of take-home exercises
- GitHub classroom group assignment.
- Basic starter code handed out as repository.
- A data analytics project based on a large data set, including the entire data pipeline.
- Tasks
- Instructions in README
- Improve efficiency of given code
- Extend code: complete specific tasks
- Explain/document procedure (conceptual understanding)
- ‘Product’: the repository, including R code, and a report in R markdown.
Examination Part II: Group Projects/Presentations
- Groups formed decentrally (same groups as for take-home exercises).
- Own research question, find a data set, think of approach/strategy, implement in R, presentation of results as Rmd/R-presentation recorded in a ‘screencast’.
- Hand in screencast via Canvas/Studynet (assignment is already open), commit code/rmd to GitHub-classroom (initial group formation assignment).
Register in GitHub Classroom
- By the end of the month, teams must be set!
- Please register, if you have not done so yet and join your team in GitHub Classroom!
- Still problems finding a team? Use the Q&A Section in Canvas! In case of emergencies, email me: ulrich.matter@unisg.ch
Recap Week 5
Beyond memory
- RAM is not sufficient to handle the amount of data to be analyzed…
- What to do?
- Scale up by using parts of the available Mass Storage (hard-disk) as virtual memory
Virtual memory
